Method and system for processing an image featuring multiple scales

ABSTRACT

A method of processing an image is disclosed. The method comprises obtaining an image decomposed into a set of scaled images, each being characterized by a different image-scale; and calculating, for each of at least some scaled images, a relative luminance between the scaled image and another scaled image of the set, using intensities in the scaled image and intensities in the another scaled image. The method further comprises processing each scaled image using an adaptation procedure featuring an image-specific effective saturation function of the relative luminance, thereby providing a processed scaled image; combining at least some of the processed scaled images to provide a combined image; and outputting the combined image to a computer readable medium.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 13/759,155 filed on Feb. 5, 2013, which is a Continuation-in-Part of PCT Patent Application No. PCT/IL2011/000639 having International Filing Date of Aug. 4, 2011, which claims the benefit of priority under 35 USC §119(e) of U.S. Provisional Patent Application No. 61/370,812 filed on Aug. 5, 2010. The contents of the above applications are all incorporated by reference as if fully set forth herein in their entirety.

FIELD AND BACKGROUND OF THE INVENTION

The present invention, in some embodiments thereof, relates to image processing. Specifically, the present embodiments can be used for providing an automatic dynamic range modulation of a digital image. In various exemplary embodiments of the invention the method and/or apparatus is used for companding (compressing and expanding) a high dynamic range (HDR) image. The present embodiments further comprise an imaging system.

High dynamic range imaging (HDRI) is a set of techniques that allow a far greater dynamic range of exposures (large difference between light and dark areas) than normal digital imaging techniques. The intention of HDRI is to accurately represent the wide range of intensity levels found in real scenes ranging from direct sunlight to the deepest shadows.

HDRI was originally developed for use with purely computer-generated images. Later, methods were developed to produce a HDR image from a set of photos taken with a range of exposures. With the rising popularity of digital cameras and easy to use desktop software, many amateur photographers have used HDRI methods to create photos of scenes with a high dynamic range.

HDR images require a higher number of bits per color channel than traditional images, both because of the linear encoding and because they need to represent values from 10⁻⁴ to 10⁸ (the range of visible luminance values) or more. 16-bit (“half precision”) or 32-bit floating point numbers are often used to represent HDR pixels. However, when the appropriate transfer function is used, HDR pixels for some applications can be represented with as few as 10-12 bits for luminance and 8 bits for chrominance without introducing any visible quantization artifacts.

Digital images may contain a huge amount of data, especially for high quality display and printing. Commercially available digital imaging devices are known to acquire image information across a wide dynamic range of several orders of magnitude. Additionally, there are software solutions which fuse multiple exposures of the same scene at lower dynamic range into one image of higher dynamic range.

Typically, although at the time of image capture the acquired dynamic range is rather large, a substantial portion of it is lost once the image is digitized, printed or displayed. For example, most images are digitized to 8-bits (256 levels) per color-band, i.e., a dynamic range of about two orders of magnitude. The problem is aggravated once the image is transferred to a display or a print medium which is often limited to about 50 levels per color-band.

International Publication No. WO2009/081394, the contents of which are hereby incorporated by reference discloses an image processing technique in which a digital HDR image is processed using two adaptation procedures employed on the achromatic channel of the digital image. Each adaptation procedure incorporates a different effective saturation function of the intensity. The adaptation procedures mimic the operation of the physiological visual system, wherein the first procedure mimics the “on” retinal pathway and the second adaptation procedure mimics the “off” retinal pathways. The intensity level of each picture-element of the digital image is processed by both procedures. The result of each processing is an intermediate intensity level. All the intermediate intensity levels of the picture-element are then combined to provide a new achromatic intensity.

SUMMARY OF THE INVENTION

According to an aspect of some embodiments of the present invention there is provided a method of processing an image. The method comprises: obtaining an image decomposed into a set of scaled images, each being characterized by a different image-scale; processing each scaled image of the set using an adaptation procedure featuring an image-specific effective saturation function of intensities in the scaled image and intensities in another scaled image of the set, thereby providing a processed scaled image; combining at least some of the processed scaled images to provide a combined image; and outputting the combined image to a computer readable medium.

According to some embodiments of the present invention the method comprises, for each of at least some scaled image of the set, calculating a relative luminance between the scaled image and another scaled image of the set, using intensities in the scaled image and intensities in the another scaled image. According to some embodiments of the present invention the image-specific effective saturation function is a function of the relative luminance.

According to some embodiments of the present invention the method comprises receiving the image and decomposing the image into the set of scaled images.

According to some embodiments of the invention the decomposing comprises selecting a size of the set based on a size of the image.

According to some embodiments of the invention the decomposing comprises determining an amount of information in each scaled image being formed, and ceasing the decomposing when the amount of information is below a predetermined threshold.

According to an aspect of some embodiments of the present invention there is provided a method of capturing and displaying an image. The method comprises capturing an image of a scene and processing the image using the method as delineated above.

According to some embodiments of the present invention the method comprises recording radiation selected from the group consisting of visible light, infrared light, ultraviolet light, X-ray radiation, radiofrequency radiation, microwave radiation and ultrasound radiation, thereby capturing the image.

According to an aspect of some embodiments of the present invention there is provided a computer software product, comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to execute the method as delineated above.

According to an aspect of some embodiments of the present invention there is provided a system for processing an image, the system comprises a data processor configured for: decomposing the image into a set of scaled images, each being characterized by a different image-scale; processing each scaled image of the set using an adaptation procedure featuring an image-specific effective saturation function of intensities in the scaled image and intensities in another scaled image of the set, thereby providing a processed scaled image; and combining at least some of the processed scaled images to provide a combined image.

According to some embodiments of the present invention the data processor calculates, for each scaled image of the set, a relative luminance between the scaled image and another scaled image of the set using intensities in the scaled image and intensities in the another scaled image. According to some embodiments of the present invention the image-specific effective saturation function is a function of the relative luminance.

According to an aspect of some embodiments of the present invention there is provided an imaging system. The imaging system comprises an image capturing system and the processing system as delineated above.

According to some embodiments of the present invention the image capturing system is selected from the group consisting of a digital camera, a video camera, a CMOS digital camera, an infrared camera, an X-ray camera, a scanner, a microwave imaging, a computerized tomography scanner, a magnetic resonance imaging scanner, a mammography scanner, an ultrasonic scanner, an impedance imaging system, an endoscopic imaging device, a radio telescope, a digital telescope, a digital microscope and a system for translating an analog image to a digital image.

According to some embodiments of the present invention a characteristic dynamic range of the combined image is lower than a characteristic dynamic range of the original image.

According to some embodiments of the present invention the scaled images are combined by multiplication.

According to some embodiments of the present invention the set is an ordered set and wherein the relative luminance is expressed as function of a ratio between the intensities in the scaled image and the intensities in the other scaled image.

According to some embodiments of the present invention the image-specific effective saturation function comprises an image-specific exponent, which is a function of a local contrast within the scale-image.

According to some embodiments of the present invention the processing comprises modulating each relative luminance to provide a plurality of modulated relative luminance levels, wherein the combining comprises combining the modulated relative luminance levels.

According to some embodiments of the present invention the modulating comprises selecting a relative luminance level such that two effective saturation functions corresponding to different image-specific exponent but the same scale are substantially matched, more preferably substantially equal.

According to some embodiments of the present invention the local contrast is calculated using a contrast-based adaptation procedure employed for each picture-element of the scaled image.

According to some embodiments of the present invention the contrast-based adaptation procedure calculates the local contrast based on a difference between a second order opponent receptive field function calculated for the picture-element and a second order opponent receptive field function calculated for nearby picture-elements.

According to some embodiments of the present invention the image-specific exponent is a decreasing function of the local contrast.

According to some embodiments of the present invention the image-specific exponent is a linear decreasing function of the local contrast.

According to some embodiments of the present invention the image-specific effective saturation function comprises a modulation function which is calculated based on a local contrast.

According to some embodiments of the present invention the modulation function has higher values when the local contrast is low, and lower values when the local contrast is high.

According to some embodiments of the present invention the method comprises employing a global gain operation for all scaled images of the set.

According to some embodiments of the present invention the global gain operation features a global gain exponent, and the method comprises calculating the global gain exponent using an optimization procedure.

According to some embodiments of the present invention the optimization procedure comprises selecting a set of candidate gain exponents, assigning a score to each candidate gain exponent, and selecting the gain exponent responsively to the score.

According to some embodiments of the present invention the score comprises a characteristic contrast.

According to some embodiments of the present invention the set is an ordered set and wherein the scaled image and the other scaled image are adjacent images in the set.

According to some embodiments of the present invention the image is an HDR image.

According to some embodiments of the present invention the image is of at least one type selected from the group consisting of a visible light image, a stills image, a video image, an X-ray image, an infrared image, a thermal image, a ultraviolet image, a computerized tomography (CT) image, a mammography image, a Roentgen image, a positron emission tomography (PET) image, a magnetic resonance image, an ultrasound images, an impedance image, an elastography image, and a single photon emission computed tomography (SPECT) image.

Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting.

Implementation of the method and/or system of embodiments of the invention can involve performing or completing selected tasks manually, automatically, or a combination thereof. Moreover, according to actual instrumentation and equipment of embodiments of the method and/or system of the invention, several selected tasks could be implemented by hardware, by software or by firmware or by a combination thereof using an operating system.

For example, hardware for performing selected tasks according to embodiments of the invention could be implemented as a chip or a circuit. As software, selected tasks according to embodiments of the invention could be implemented as a plurality of software instructions being executed by a computer using any suitable operating system. In an exemplary embodiment of the invention, one or more tasks according to exemplary embodiments of method and/or system as described herein are performed by a data processor, such as a computing platform for executing a plurality of instructions. Optionally, the data processor includes a volatile memory for storing instructions and/or data and/or a non-volatile storage, for example, a magnetic hard-disk and/or removable media, for storing instructions and/or data. Optionally, a network connection is provided as well. A display and/or a user input device such as a keyboard or mouse are optionally provided as well.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING(S)

The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.

Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings and images. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.

In the drawings:

FIG. 1 is a flowchart diagram illustrating a method suitable for processing an image, according to some embodiments of the present invention;

FIGS. 2A and 2B are plots of processed intensities R as a function of a relative luminance q, according to some embodiments of the present invention;

FIGS. 3A and 3B are schematic illustrations of rectangular grids of picture-elements which exemplify a concept of picture-element regions, according to various exemplary embodiments of the invention;

FIG. 4 is a schematic illustration of a system for processing an image, according to some embodiments of the present invention;

FIG. 5 is a schematic illustration of an imaging system, according to some embodiments of the present invention;

FIGS. 6A and 6B show a thermal image before (6A) and after (6B) processing according to some embodiments of the present invention;

FIGS. 7A and 7B show another thermal image before (7A) and after (7B) processing according to some embodiments of the present invention;

FIGS. 8A and 8B show another thermal image before (8A) and after (8B) processing according to some embodiments of the present invention;

FIGS. 9A and 9B show another thermal image before (9A) and after (9B) processing according to some embodiments of the present invention;

FIGS. 10A and 10B show another thermal image before (10A) and after (10B) processing according to some embodiments of the present invention;

FIGS. 11A and 11B show another thermal image before (11A) and after (11B) processing according to some embodiments of the present invention;

FIGS. 12A-D show high dynamic range images, processed according to some embodiments of the present invention;

FIGS. 13A-F show a Gaussian pyramid obtained according to some embodiments of the present invention for a set of six scaled images;

FIGS. 14A-E show a luminance pyramid obtained according to some embodiments of the present invention from the Gaussian pyramid of FIGS. 13A-F;

FIGS. 15A-E show a local contrast pyramid obtained according to some embodiments of the present invention from the luminance pyramid of FIGS. 14A-E;

FIGS. 16A-E show a representation pyramid corresponding to a set of exponents obtained according to some embodiments of the present invention from the local contrast pyramid of FIGS. 15A-E;

FIGS. 17A-E show a saturation pyramid obtained according to some embodiments of the present invention from the exponents corresponding to the pyramid of FIGS. 16A-E; and

FIGS. 18A and 18B show an input image and an image processed according to some embodiments of the present invention using the pyramids of FIGS. 13A-17E.

DESCRIPTION OF SPECIFIC EMBODIMENTS OF THE INVENTION

The present invention, in some embodiments thereof, relates to image processing. Specifically, the present embodiments can be used for providing an automatic dynamic range modulation of a digital image. In various exemplary embodiments of the invention the method and/or apparatus is used for companding (compressing and expanding) a high dynamic range (HDR) image. The present embodiments further comprise an imaging system.

Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details of construction and the arrangement of the components and/or methods set forth in the following description and/or illustrated in the drawings and/or the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.

The present embodiments are concerned with method and system for processing an image to facilitate its display. At least part of the processing can be implemented by a data processing system, e.g., a dedicated circuitry or a general purpose computer, configured for receiving the image and executing the operations described below.

The method of the present embodiments can be embodied in many forms. For example, it can be embodied in on a tangible medium such as a computer for performing the method operations. It can be embodied on a computer readable medium, comprising computer readable instructions for carrying out the method operations. In can also be embodied in electronic device having digital computer capabilities arranged to run the computer program on the tangible medium or execute the instruction on a computer readable medium.

Computer programs implementing the method of the present embodiments can commonly be distributed to users on a distribution medium such as, but not limited to, a floppy disk, a CD-ROM, a flash memory device and a portable hard drive. From the distribution medium, the computer programs can be copied to a hard disk or a similar intermediate storage medium. The computer programs can be run by loading the computer instructions either from their distribution medium or their intermediate storage medium into the execution memory of the computer, configuring the computer to act in accordance with the method of this invention. All these operations are well-known to those skilled in the art of computer systems.

The image to be analyzed using the teachings of the present embodiments is generally in the form of imagery data arranged gridwise in a plurality of picture-elements (e.g., pixels, group of pixels, etc.).

The term “pixel” is sometimes abbreviated herein to indicate a picture-element. However, this is not intended to limit the meaning of the term “picture-element” which refers to a unit of the composition of an image.

References to an “image” herein are, inter alia, references to values at picture-elements treated collectively as an array. Thus, the term “image” as used herein also encompasses a mathematical object which does not necessarily correspond to a physical object. The original and processed images certainly do correspond to physical objects which are the scene from which the imaging data are acquired.

Each pixel in the image can be associated with a single digital intensity value, in which case the image is a grayscale image. Alternatively, each pixel is associated with three or more digital intensity values sampling the amount of light at three or more different color channels (e.g., red, green and blue) in which case the image is a color image. Also contemplated are images in which each pixel is associated with a mantissa for each color channels and a common exponent (e.g., the so-called RGBE format). Such images are known as “high dynamic range” images.

The input image can be provided by any imaging modality, including, without limitation, a digital camera, a video camera, a CMOS digital camera, an infrared camera, a thermography device, an X-ray camera, a scanner, a microwave imaging, a computerized tomography scanner, a single photon emission computed tomography device, a magnetic resonance imaging scanner, a mammography scanner, an ultrasonic scanner, an impedance imaging system, an endoscopic imaging device, an elastography device, a radio telescope, a digital telescope, a digital microscope and a system for translating an analog image to a digital image.

Commercially available digital imaging devices based upon CCD detector arrays are known to acquire image information across a wide dynamic range of the order of 2 to 3 orders of magnitude. It is expected that with the rapid technologically development in the field of digital imaging, this range will most likely be broadened in the near future. Typically however, although at the time of image capture the acquired dynamic range is rather large, a substantial portion of it is lost once the image is digitized, printed or displayed. For example, most images are digitized to 8-bits (256 levels) per color-band, i.e., a dynamic range of about two orders of magnitude. The problem is aggravated once the image is transferred to a display or a print medium which is often limited to about 50 levels per color-band.

A novel imaging technology, recently developed, employs CMOS with active pixel sensors [O. Yadid-Pecht and E. Fossum, “Image Sensor With Ultra-High-Linear-Dynamic Range Utilizing Dual Output CMOS Active Pixel Sensors”, IEEE Trans. Elec. Dev., Special issue on solid state image sensors, Vol. 44, No. 10, 1721-1724], which are capable of locally adjusting the dynamical range, hence to provide a high quality image with high dynamic range.

In addition, over the past years software solutions were developed for fuse multiple exposures of the same scene at low dynamic range (e.g., 256 levels per color-band) into one high dynamic range image (of about 4 orders of magnitudes). High dynamic range images are typically provided in an RGBE format. In this format, 4 bytes are used (as opposed to 3 bytes in conventional images) to create a representation similar to floating point, where the first three bytes represent the three RGB color channels and the forth byte represents a common exponent to the three colors channels. The dynamic range of such images is about 4 orders of magnitude.

The motivation for developing imaging devices capable of capturing high dynamic range images is explained by the enormous gap between the performances of the presently available devices and the ability of the human visual system to acquire detailed information from an ultra-high dynamic range scene. Specifically, the human visual system, which is capable of acquiring a dynamic range of 14 orders of magnitude, can easily recognize objects in natural light having a dynamic range of 12 orders of magnitude.

Still, there is a growing gap between the state-of-the-art imaging devices and display devices. High quality images, obtained either with photographical film or by digital cameras, suffer, once displayed on a screen or printed as a hard copy from loss in clarity of details and colors at extreme light intensities, within shadows, dark regions, extremely bright regions and/or surfaces close to a lightening source. For example, as a single sharp edge in natural scene (e.g., a shaded object in illuminated scene) can reach a dynamic range of 2 orders of magnitudes, presently available display devices may not be able to recognize such an edge. Another severe problem is that in a specific exposure a dark region of the image may be seen while a bright region is over exposed, or vise versa.

The technique developed by the present inventor is suitable for HDR images as well as other images.

Referring now to the drawings, FIG. 1 is a flowchart diagram illustrating a method suitable for processing an image, according to some embodiments of the present invention. The method of the present embodiments can be used for processing any image including, without limitation, a visible light image, a stills image, a video image, an X-ray image, a thermal image, a ultraviolet image, a computerized tomography (CT) image, a mammography image, a Roentgen image, a positron emission tomography (PET) image, a magnetic resonance image, an ultrasound images, an impedance image, and a single photon emission computed tomography (SPECT) image.

The method begins at 10 and optionally continues to 11 at an image of a scene is captured. The image can be captured using any imaging technique known in the art, including, without limitation, visible light imaging, infrared light imaging, ultraviolet light imaging, X-ray imaging, radiofrequency imaging, microwave imaging and ultrasound imaging. The imaged scene can be of any type, including, without limitation, an outdoor scene, an indoor scene, a nearby scene, a remote scene, an astronomical scene, an underwater scene, an intracorporeal scene (namely a scene that includes internal organs of a subject), an extracorporeal scene (namely a scene that includes external organs of a subject), and any combination thereof.

Alternatively, 11 can be skipped in which case an image is received as a stream of imaging data, as further detailed hereinabove.

In some embodiments of the present invention, the imaged is subjected to a preprocessing operation, such as, but not limited to, the preprocessing operation described in International Patent Application No. PCT/IL2008/001419, the contents of which are hereby incorporated by reference. This embodiment is particularly useful when the image is formed by a computerized tomography technique, e.g., CT of SPECT.

At 12 the image is optionally and preferably decomposed into a set of scaled images, each being characterized by a different image-scale. Alternatively, 12 can be skipped, in which case the set of scaled images is received by the method from an external source.

In various exemplary embodiments of the invention the set is an ordered set, wherein the kth element of the set is a blurred version of the k−1 element. In other words, the images in the set are ordered such that the resolution of the k−1 image is finer than the resolution of the kth image. The decomposition 12 can be done using any procedure known in the art.

A known operator for decomposing an image is referred to in the literature as “Reduce.” A representative example of a Reduce operator suitable for the preset embodiments is described in Burt and Adelson, 1983, “The Laplacian Pyramid as a Compact Image Code,” IEEE Transactions On Communications, vol. Com-31, No. 4, the contents of which are hereby incorporated by reference. When a Reduce operator is employed, the k+1 element of the set can be calculated based on the kth element, as follows I^(k+1)=Reduce(I^(k)). In some embodiments of the present invention the set of scaled images form an “Image Pyramid Representation” which is formed by successively applying a downsampling filter having a weighting function centered on the pixel itself. In some embodiments of the present invention the weighting function is a unimodal function, such as a function having a shape which is approximately a Gaussian. When the weighting function approximates a Gaussian, the Image Pyramid Representation is referred to as a Gaussian Pyramid. However, it is not intended to limit the scope of the present embodiments to unimodal or Gaussian weighting function. Other types of weighting function, such as, but not limited to, a triangular weighting function and a trimodal weighting function, are not excluded from the scope of the present invention.

In some embodiments of the present invention a scaled image is obtained by downsampling an image of a finer resolution. Thus, denoting the intensities of the kth scaled image by I^(k) (x, y), where the set of tuples (x, y) represents the picture-elements in the image, the intensities of the k+1 scaled image can be written as I^(k+1) (x, y)=I^(k)(ρ_(DS)x,ρ_(DS)y) where ρ_(DS) is a predetermined downsampling coefficient, and where I¹ (x, y) can be, for example, the original image, denoted I_(in)(x,y). In some embodiments, the decomposing is done by integrating the image I_(in)(x,y) with a kernel function, using a different spatial support for each resolution. A representative example of a kernel function for the kth scaled image is a Gaussian kernel,

${\frac{1}{\sqrt{\pi}\rho^{k}}{\exp \left( {- \frac{x^{2} + y^{2}}{\left( \rho^{k} \right)^{2}}} \right)}},$

where ρ is a predetermined parameter.

In some embodiments, the decomposing employs an edge-preserving smoothing filter, such as, but not limited to, a bilateral edge-preserving smoothing filter. A bilateral is a non-linear filter introduced by Tomasi and Manduchi (see “Bilateral filtering for gray and color images,” Proc. IEE Intl. Conf. on Computer Vision, Bombay, India, 1998), which is used for selective de-noising an images without blurring its edge. The bilateral filter takes into consideration both geometric distances in the spatial domain and similarities in the intensity domain. A bilateral filter typically features a convolution mask having weights which are modified as a function of intensity differences between a picture-element under consideration and its neighbors.

Denoting the edge-preserving smoothing filter by EPF, the kth scaled image of the present embodiments can have the form EPF(I^(k−1)). A representative example of a edge-preserving smoothing filter suitable for the present embodiments is found, for example, in U.S. Pat. Nos. 5,771,318, 7,146,059 and 7,199,793 the contents of which are hereby incorporated by reference.

As a representative example, which is not-intended to be considered as limiting, the following expression can be used as a bilateral filter:

$\begin{matrix} {{{EPF}\left( {I\left( \overset{\rightarrow}{r} \right)} \right)} = \frac{\int{\int_{image}{{G_{r}\left( {{\overset{\rightarrow}{r} - {\overset{\rightarrow}{r}}^{\prime}}}\  \right)}{G_{s}\left( {{{I\left( {\overset{\rightarrow}{r}}^{\prime} \right)} - {I\left( {\overset{\rightarrow}{r}}^{\prime} \right)}}} \right)}{I\left( {\overset{\rightarrow}{r}}^{\prime} \right)}{^{2}{\overset{\rightarrow}{r}}^{\prime}}}}}{\int{\int_{image}{{G_{r}\left( {{\overset{\rightarrow}{r} - {\overset{\rightarrow}{r}}^{\prime}}}\  \right)}{G_{s}\left( {{{I\left( {\overset{\rightarrow}{r}}^{\prime} \right)} - {I\left( {\overset{\rightarrow}{r}}^{\prime} \right)}}} \right)}{^{2}{\overset{\rightarrow}{r}}^{\prime}}}}}} & \left( {{EQ}.\mspace{14mu} 1} \right) \end{matrix}$

where the vectors {right arrow over (r)} and {right arrow over (r)}′ represent the coordinates of the picture-elements in the image, for example, when the image is defined over a Cartesian coordinate system, {right arrow over (r)}=(x, y) and {right arrow over (r)}′=(x′, y′); G_(r) and G_(s) are localized functions with finite supports; and the integration measure d²{right arrow over (r)}′ includes a Jacobian which corresponds to the coordinate system over which the image is defined, for example, for a Cartesian coordinate system d²{right arrow over (r)}′=dx′ dy′. While some of the embodiments below are described, for clarity of presentation, by means of a Cartesian coordinate system, it is to be understood that more detailed reference to a Cartesian coordinate system is not to be interpreted as limiting the scope of the invention in any way. Notice that EQ. 1 features localized functions both in the spatial domain and in the intensity domain. Specifically, the function G_(r) is centered at the coordinate {right arrow over (r)} and the function Gs is centered at the intensity I({right arrow over (r)}) that is associate with the coordinate {right arrow over (r)}.

The localized functions G_(r) and G_(s) can have any form provided it has a finite support. Representative examples including, without limitation, Gaussians, Lorenzians, modified Bessel functions. In some embodiments of the present invention both G_(r) and G_(s) are Gaussians, e.g.,

G _(s)(|r|)=exp(−r ²/σ_(s) ²); G _(r)(r)=exp(−r ²/σ_(r) ²),  (EQ. 2)

where σ_(s) and σ_(r) are radius parameters characterizing the local support of G_(s) and G_(r), respectively.

The decomposition 12 preferably features a set of image-specific filters, wherein for each scaled image an image-specific filter with different image-specific parameters is employed. For example, when EQs. 1 and 2 are employed, each scaled image is associated with a different set of radius parameters. The radius parameters used for obtaining the kth image I^(k) are denoted σ_(r) ^(k) and σ_(s) ^(k). Thus, the kth image I^(k) is preferably calculated using the expression I^(k)=EPF(I^(k−1)), wherein EPF features two localized functions G_(r) and G_(s) with two respective image-specific radius parameters σ_(r) ^(k) and σ_(s) ^(k).

The method optionally and preferably continues to 13 at which each of at least some of the scaled images is processed using an adaptation procedure featuring an image-specific effective saturation function.

The effective saturation function is “image-specific” in the sense that for each scaled image the procedures defined a specific effective saturation function which is typically different from the effective saturation function defined for any other scaled image in the set. In various exemplary embodiments of the invention the effective saturation function is applied for each picture-element of the scaled image being processed and can therefore be viewed as a processed scaled image. In other words, the returned values of the effective saturation function can be used as image intensities. The effective saturation function for the kth scaled image I^(k)(x,y) is denoted R^(k)(x,y), and is interchangeable referred to herein as the kth processed scaled image.

R^(k) is optionally and preferably a function of intensities in kth image as well of intensities in at least one scaled image of the set which is other than the kth scaled image. In various exemplary embodiments of the invention R^(k) is a function of intensities in at least the kth image and an image which is a blurred version (e.g., with a coarser resolution) of the kth image. For example, R^(k) can be a function of the I^(k) and I^(k+1). A typical expression for R^(k) is (for clarity of presentation, the spatial dependence of R^(k), I^(k) and I^(k+1) on the location (x, y) of the picture-element has been omitted):

$\begin{matrix} {R^{k} = \frac{R_{\max}I^{k}}{{\alpha \; I^{k}} + {\beta \; I^{k + 1}}}} & \left( {{EQ}.\mspace{14mu} 3} \right) \end{matrix}$

where, R_(max), α and β are coefficients, which can be constants or they can vary across the image and/or between images. For example, R_(max), α and β can each be set to 1, but other values are not excluded from the scope of the present invention.

In some embodiments, R^(k) is a function of a relative luminance q^(k) between the kth scaled image and the other scaled image (e.g., the k+1 scaled image). In various exemplary embodiments of the invention q^(k) is the relative luminance per picture-element, namely it has a specific value for each picture-element in the kth scaled image. In these embodiments, q^(k)=q^(k)(x,y), where the set of tuples (x, y) represents the picture-elements in the kth image. The relative luminance q^(k)(x,y) can optionally and preferably be expressed in terms of the ratio between intensities in the kth scaled image and intensities in the other scaled image. For example, q^(k)(x,y) can be defined as q^(k)(x,y)=ƒ(I^(k)(x,y)/I^(k+1)(x,y)), where ƒ is some function, preferably a monotonically increasing function, e.g., a linear function characterized by a positive slope. Thus, in some embodiments of the present invention q^(k)(x,y) is defined as a I^(k)(x,y)/I^(k+1)(x,y)+b, where a and b are parameters which are optionally constants. In some embodiments, a=1 and b=0, but other values are not excluded from the scope of the present invention.

The calculation of the relative luminance q^(k) optionally and preferably comprises some interpolation of the picture-elements in the coarser image as known in the art. For example q^(k) can be calculated as ƒ(I^(k)(x,y)/Expand(I^(k+1)(x,y))), where Expand is an interpolation operator that preferably matches the number of elements in I^(k+1) with the number of elements in I^(k) using an interpolation algorithm. In some embodiments of the present invention Expand is the reverse operator of the Reduce operator. A representative example of an Expand operator suitable for the present embodiments is described in Burt and Adelson, supra.

A typical form of R^(k) when expressed as a function of the relative luminance q^(k) is (for clarity of presentation, the spatial dependence of q^(k) and R^(k) has been omitted):

$\begin{matrix} {{R^{k}\left( q^{k} \right)} = {\frac{R_{\max}}{\alpha + \left( {M/q^{k}} \right)^{\gamma}} + B}} & \left( {{EQ}.\mspace{14mu} 4} \right) \end{matrix}$

where R_(max) and α are parameters already introduced above, M and γ are a modulation coefficient and an exponent, respectively, and B is an offset parameter. Each of M, γ and B can be a constant or some function of the intensity. In the simplest case, α, M and γ are set to 1, and B is set to zero, so that R^(k) is reduced to the form R^(k)=Rmax/(a+(q^(k))⁻¹). However, this need not necessarily be the case, since, for some applications, it may be desired to let γ and/or M and/or B be different from the above values and/or vary. Higher values of γ cause enhancement of the rate of change in R as a function of q, particularly at the vicinity of q=1, where there are small differences between the luminance of a specific location and its context. The effect of the exponent γ on R^(k) is exemplified in FIG. 2A, which are plots of R^(k) as a function of q^(k), for Rmax=M=1, B=0 and seven fixed values of γ: γ=0.5, γ=1, γ=2, γ=5, γ=7, γ=10 and γ=15. It is to be understood that these values are for illustrative purpose only and are not to be considered as limiting.

In some embodiments, the value of γ is specific to the scaled image. In these embodiments, the exponent used for the kth scaled image is denoted γ^(k). In some embodiments of the present invention γ^(k) is a decreasing function, e.g., a linear decreasing function, of k. Preferably, γ^(k) is positive for all values of k.

In some embodiments, for the coarse scales (high k) that reflect the illumination γ^(k) can be less than 1 so as to compress the high dynamic range of the illumination; at finer scales (low k) γ^(k) can be set to a value which is higher than 1. Alternatively, γ^(k) is above 1 for all resolution. In some embodiments, γ^(k) satisfies γ_(min)≦γ^(k)≦γ_(max), where γ_(min) and γ_(max) are predetermined parameters which are the same for all scales. In some specific embodiments of the present invention γ_(min) is from about 1 to about 3, e.g., about 2 and γ_(max) is from about 5 to about 9, e.g., about 7.

As a representative and non limiting example for a linear decrease of γ as a function of the resolution index k, γ can be decreased by Δγ for each integer increment of k, where Δγ is from about 0.1 to about 0.4 or from about 0.2 to about 0.3, e.g., about 0.25.

The modulation coefficient M can be viewed as a parameter which modulates the relative luminance q^(k). Formally, the expression q^(k)/M can be defined as an effective relative luminance, wherein higher values of M correspond to lower effective relative luminance and lower values of M correspond to higher effective relative luminance. The effect of the modulation coefficient M on R^(k) is exemplified in FIG. 2B, which are plots of R^(k) as a function of q^(k), for R_(max)=γ=1 and three fixed values of M: M=0.5, M=1 and M=2. It is to be understood that these values are for illustrative purpose only and are not to be considered as limiting. Generally, larger values of M suppress the value of R^(k). M can be a global constant or it can vary over the picture-elements of the scaled image being processed and/or across the scaled images in the set. When M varies over the picture-elements it is realized as a function of the coordinates, for example, M=M(x,y), and when M varies across the scaled images of the set, the method features a set {M^(k)} of coefficients each of which can be a constant coefficient or a function, e.g., M^(k)=M^(k)(x,y).

While the embodiments above are described with a particular emphasis to an effective relative luminance having the form expression q^(k)/M, it is to be understood that more detailed reference to such expression is not to be interpreted as limiting the scope of the invention in any way. Generally, an effective relative luminance {circumflex over (q)}^(k) can be obtained using any linear or non-linear modulation operation, in which case the effective saturation function can be written as:

$\begin{matrix} {{R^{k}\left( q^{k} \right)} = {\frac{R_{\max}}{\alpha + \left( {\hat{q}}^{k} \right)^{- \gamma}} + B}} & \left( {{{EQ}.\mspace{14mu} 4}A} \right) \end{matrix}$

In some embodiments of the present invention the value of the exponent γ and/or coefficient M is calculated based on image intensities in the scaled image.

For example, the image-specific exponent γ^(k) can be a function of a local contrast C^(k) within the scaled image I^(k). Preferably, γ^(k) decreases with C^(k). In some embodiments, γ^(k) decreases linearly with C^(k), and in some embodiments γ^(k) decreases non-linearly with C^(k). A preferred relation between γ^(k) and C^(k) is:

γ^(k) =f(C _(max))−C ^(k),  (EQ. 5)

where C_(max) is a constant parameter which, in some embodiments, is the maximal contrast over image I^(k), and f is some function. Representative examples of expressions suitable for the function f(C_(max)) including, without limitation, f(C_(max))=C_(max), and f(C_(max))=pC_(max), where p is a constant parameter which is preferably larger than 1.

An alternative expression for a linear relation between γ^(k) and C^(k) is:

γ^(k)=δ(1−C ^(k))  (EQ. 5A)

where δ is a contrast enhancement parameter. Typically, larger values of δ correspond to higher characteristic contrast of the final image.

Also contemplated is a non-linear relation between γ^(k) and C^(k), for example,

γ^(k) =N/(C ^(k))^(n),  (EQ. 6)

where N and n are positive constants, e.g., N=n=1.

The modulation operation executed for providing the effective relative luminance {circumflex over (q)}^(k) (for example, the value of M, in embodiments in which {circumflex over (q)}^(k)=q^(k)/M) is optionally and preferably selected such that the returned values of the effective saturation functions for two different exponents be approximately equal for a given scale. For example, consider the kth the effective saturation function R^(k). This function can be calculated more than once, using a different value for the exponent γ^(k) at each calculation. Without loss of generality, suppose that for a given scale k, two saturation functions R₀ ^(k) and R₁ ^(k) are calculated, as follows:

${{R_{0}^{k}\left( q^{k} \right)} = {\frac{R_{\max}}{\alpha + \left( {M/q^{k}} \right)^{\gamma_{0}^{k}}} + B}};$ $\; {{R_{1}^{k}\left( q^{k} \right)} = {\frac{R_{\max}}{\alpha + \left( {M/q^{k}} \right)^{\gamma_{1}^{k}}} + {B.}}}$

The exponents γ₀ ^(k) can have a fixed and predetermined (e.g., the same value for γ₀ ^(k) for all values of k), and γ₁ ^(k) can be selected according to the local contrast as further detailed hereinabove. In various exemplary embodiments of the invention γ₀ ^(k) is the lowest allowed value of the exponent, e.g., γ₀ ^(k)=γ_(min). In some embodiments of the present invention the effective relative luminance {circumflex over (q)}^(k) is selected such that R₀ ^(k)({circumflex over (q)}^(k))=R₁ ^(k)(q^(k)).

The local contrast C^(k) can be calculated from the intensity values of the picture-element in the respective scaled image using any known procedure for detecting or calculating local contrast. Representative techniques suitable for the present embodiments are found, for example, in U.S. Pat. Nos. 7,791,652, 7,929,739, 6,078,686 and 5,838,835 the contents of which are hereby incorporated by reference.

In some embodiments of the present invention the local contrast is calculated based on one or more intensity differences between scaled images in the set. Typically, the local contrast is calculated based on intensity differences between scaled images whose resolution is not lower than the resolution of the currently processed scaled image. A representative example for the local contrast C^(k) in these embodiments is:

$\begin{matrix} {{C^{k} = {\sum\limits_{i = 0}^{k}\; {{I^{i} - I^{i + 1}}}^{ɛ}}},} & \left( {{{EQ}.\mspace{14mu} 6}A} \right) \end{matrix}$

where ε is a local contrast exponent. Typical values for ε are from about 0.1 to about 1, e.g., about 0.3.

In some embodiments of the present invention the local contrast is calculated using a contrast-based adaptation procedure which can be constructed so as to mimic a mechanism of the human vision system known as a second order achromatic induction. The contrast-based adaptation procedure of the present embodiments is preferably as follows.

Firstly, the procedure mimics the transformation of a visual stimulus into a response of post retinal second order opponent receptive fields (SORF's). These SORF's may refer to cortical levels even though the receptive fields are not necessarily oriented. These SORF's have various spatial resolutions, in compliance with the diversity found in the human visual system. The number of different spatial resolutions employed by the procedure and the number of scaled images in the set can be the same or they can be different. In some embodiments of the present invention the number of different spatial resolutions employed by the contrast-based procedure is larger than the number of scaled images in the set.

Secondly, local and remote contrasts are calculated based on the multi scale SORF responses, and thirdly a contrast-contrast induction is employed. The contrast-contrast induction serves as a contrast gain control and is expressed by the adapted responses of the SORF cells.

In the human visual system, the SORF cells receive their input from the retinal ganglion cells through several processing layers. The retinal ganglion cells perform a (first order) adaptation and the SORF cells receive their responses after the adaptation. In the following description, the first order adaptation is not modeled for clarity of presentation, but the skilled artisan, provided with the information described herein would know how to employ first order adaptation, e.g., using the formalism of center and surround adaptation terms described above and/or the first order adaptation described in Barkan et al., (2008), “Computational adaptation model and its predictions for color induction of first and second orders,”, J. of vision 8(7) 27 1-26.

The SORF cells have an opponent type receptive field with a center-surround spatial structure. Thus, in various exemplary embodiments of the invention the method defines, for each picture-element 20 one or more regions in the vicinity of picture-element 20 (but not necessarily adjacent thereto). Typically, a surrounding region is defined. In some embodiments the method also defines a center region which comprises element 20 and picture elements immediately adjacent to element 20. Alternatively, the center region may be a single element region, hence comprising only element 20. This alternative, of course, coincide with the embodiment in which no center region is selected.

The concept of the center and surrounding regions may be better understood from the following example, with reference to FIGS. 3A-B. Thus, if the picture elements are arranged in a rectangular grid 30, the center region may be a single picture element (element 20), and the surround region may be picture elements 32 surrounding picture elements 20. Picture elements 34, surrounding picture elements 32 can be referred to as remote region.

In FIG. 3A, the surround region comprises eight picture-elements immediately surrounding (i.e., adjacent to) element 20, and the remote region comprises 40 picture-element forming the two layers surrounding those eight picture-elements. However, this need not necessarily be the case, since, for some applications, it may be desired to extend the surround region farther from those eight elements which immediately surround element 20. FIG. 3B, for example, illustrates an embodiment in which the surround region comprises 48 picture-element which form the first three layers surrounding element 20, and the remote region comprises 176 picture-element which form the four layers surrounding those 48 elements. Also contemplated are embodiments in which the center region comprises more picture-element, e.g., an arrangement of 2×2 or 3×3 picture-elements. Other definitions for the center, surrounding and remote regions are not excluded from the present invention, both for a rectangular grid or for any other arrangement according to which the picture elements of the scaled image are received.

Once the region(s) are defined, the intensities of the picture elements in each region are preferably used for calculating, for each region, an overall regional intensity. The overall intensity can be calculated as a convolution of the intensity of the picture elements in each region with the respective regional spatial profile. For the center region, this convolution is preferably realized by the following equations:

$\begin{matrix} {L_{cen}^{k} = {\underset{cen}{\int\int}{I^{k}\left( {x,y} \right)}{f_{c}^{k}\left( {{x - x_{0}},{y - y_{0}}} \right)}{x}{y}}} & \left( {{EQ}.\mspace{14mu} 7} \right) \end{matrix}$

where (x₀, y₀) is the location of the center of the center region, ƒ_(c) ^(k) is the spatial profile for the center region at the kth scaled image, and the integration is over all the picture-elements in the center region. Without loss of generality, (x₀, y₀) can be set to (0, 0). In the following description, the reference to x₀ and y₀ is therefore omitted. The center spatial profile is preferably a spatial decaying function. Examples for the specific form of the spatial profile of the center region include, but are not limited to, a Gaussian, an exponent, a Lorenzian, a modified Bessel function and a power-decaying function.

In some embodiments of the present invention, ƒ_(c) ^(k) is given by:

$\begin{matrix} {{{{f_{c}^{k}\left( {x,y} \right)} = \frac{\exp \left( {{- x^{2}} + {y^{2}/\rho_{cen}^{k\; 2}}} \right)}{\pi \cdot \rho_{cen}^{k}}};}{x,{y \in {center\_ area}}}} & \left( {{EQ}.\mspace{14mu} 8} \right) \end{matrix}$

where ρ_(cen) represents the radius of the center region.

For the surround region, the convolution is similarly defined:

$\begin{matrix} {L_{srnd}^{k} = {\underset{srnd}{\int\int}{I^{k}\left( {x,y} \right)}{f_{s}^{k}\left( {x,y} \right)}{x}{y}}} & \left( {{EQ}.\mspace{14mu} 9} \right) \end{matrix}$

where ƒ_(c) ^(k) is the spatial profile for the surround region at the kth scaled image, and the integration is over all the picture-elements in the surround region. Examples for the specific form of the spatial profile of the surround region include, but are not limited to, a Gaussian, an exponent, a Lorenzian, a modified Bessel function and a power-decaying function.

In some embodiments of the present invention, ƒ_(s) ^(k) is given by:

$\begin{matrix} {{{{f_{s}^{k}\left( {x,y} \right)} = \frac{\exp \left( {{- x^{2}} + {y^{2}/\rho_{sur}^{k\; 2}}} \right)}{\pi \cdot \rho_{sur}^{k}}};}{x,{y \in {surround\_ area}}}} & \left( {{EQ}.\mspace{14mu} 10} \right) \end{matrix}$

where ρ_(sur) (also referred to below as ρ_(sur)) represents the radius of the surrounding region. The total weight of ƒ_(c) and ƒ_(s) is typically 1.

The response of the SORF L_(sorf) can be calculated based on the difference between the intensity value of picture-element(s) in the center region and intensity values of nearby picture-elements in the surround region. For example,

L _(sorf) ^(k) =L _(cen) ^(k) −L _(srnd) ^(k)  (EQ. 11)

The calculations of the SORF response can be alternatively derived from an achromatic double-opponent receptive field structure, where the center and the surrounding regions contained opponent sub-units described for example, in Barkan et al. 2008, supra.

L^(k) _(sorf) is optionally and preferably used for calculating a contrast term, for example, by averaging the value of L^(k) _(sorf) over several scaled images or, more preferably, over all the scaled images in the set. A representative examples for the contrast C is given by:

$\begin{matrix} {{C^{k}\left( {x,y} \right)} = {\sum\limits_{k^{\prime}}\; \frac{\int{\int{{{L_{sorf}^{k^{\prime}}\left( {x^{\prime},y^{\prime}} \right)}}^{\delta}{W^{k^{\prime}}\left( {{x^{\prime} - x},{y^{\prime} - y}} \right)}{x}{y}}}}{\int{\int{{W^{k^{\prime}}\left( {x,y} \right)}{x}{y}}}}}} & \left( {{EQ}.\mspace{14mu} 12} \right) \end{matrix}$

where the integrations are preferably over a region encompassing the scales that are defined in the contrast region, δ is a parameter, and W^(k)(x,y) are weight functions which are preferably localized at (x,y), with some predetermined support. The δ can be any integer or non-integer positive number. Typically, the δ parameter satisfies δ≧1, e.g., δ=1 or δ=2 or δ=3. A representative example of W^(k)(x,y) is a two-dimensional Gaussian:

$\begin{matrix} {{{W^{k}\left( {x,y} \right)} = {\exp\left( {- \frac{x^{2} + y^{2}}{\rho_{local}^{2}}} \right)}},} & \left( {{EQ}.\mspace{14mu} 13} \right) \end{matrix}$

where ρ_(local) is a radius parameter representing the size of the support.

Once calculated, C^(k) can be used for calculating the exponent γ^(k), by means of a linear or non-linear relation, as further detailed hereinabove.

The above contrast-based procedure can also be used for calculating the modulation coefficient M described above (see EQ. 4). In some embodiments, M is a decreasing function of L^(k) _(sorf). In other words M has higher values when the local contrast is low, and lower values when the local contrast is high. For example, the coefficient M^(k) for the kth scaled image can be a linear decreasing function of L_(sorf), e.g., M=f(I^(k) _(max))−L^(k) _(sorf), where I^(k) _(max) is the maximal intensity over the kth scaled image, and f is some function, e.g., f(I^(k) _(max))=I^(k) _(max) or f(I^(k) _(max))=mI^(k) _(max) where m is a positive parameter. Alternatively, M can be a non-linear decreasing function of L_(sorf). A representative example is the expression M=1/L^(k) _(sorf) or the like. Other expressions are not excluded from the scope of the present invention.

When EQ. 3 is employed, the contrast-based procedure can be used for calculating the β coefficient. In some embodiments, the β coefficient is a decreasing function of L^(k) _(sorf). In other words β has higher values when the local contrast is low, and lower values when the local contrast is high. For example, the coefficient β^(k) for the kth scaled image can be a linear decreasing function of L_(sorf), e.g., β^(k)=f(I^(k) _(max))−L^(k) _(sorf), where I^(k) _(max) is the maximal intensity over the kth scaled image, and f is some function, e.g., f(I^(k) _(max))=I^(k) _(max) or f(I^(k) _(max))=mI^(k) _(max) where m is a positive parameter. Alternatively, M can be a non-linear decreasing function of L_(sorf). A representative example is the expression β^(k)=1/L^(k) _(sorf) or the like. Other expressions are not excluded from the scope of the present invention.

Referring again to FIG. 1 the method can continue to 14 at which at least some of the processed scaled images are combined to provide a combined image. The images are preferably combined by multiplication. For example a combined image I_(combined) can be obtained using the following equation:

$\begin{matrix} {{I_{combined} = {\prod\limits_{k}\; I^{k}}},} & \left( {{EQ}.\mspace{14mu} 14} \right) \end{matrix}$

where the multiplication is over some or all the scaled images in the set.

The combined image can also be obtained from the relative luminance of the effective saturation functions, rather than from the effective saturation functions themselves. These embodiments are particularly useful when the relative luminance levels are modulated. For example, the combined image can be obtained by multiplying the effective relative luminance values of at least a few, more preferably, all the effective saturation functions:

$\begin{matrix} {{I_{combined} = {\prod\limits_{k}\; {\hat{q}}^{k}}},} & \left( {{{EQ}.\mspace{14mu} 14}A} \right) \end{matrix}$

where the multiplication is over some or all the scaled images in the set.

The calculation of the combined image optionally and preferably comprises some interpolation of the picture-elements in the coarser image as known in the art. For example, the following iterative process can be employed:

I ^(n) ={circumflex over (q)} ^(n) {circumflex over (q)} ^(n+1)

{circumflex over (q)} ^(n+1)=Expand(I ^(n+1)),  (EQ. 14B)

where Expand is an interpolation operator as further detailed hereinabove. Once the iterative process (14B) is completed, the combined image can be defined as image obtained at the last iterative step, e.g., I⁰.

The method optionally continues to 15 at which the combined image is normalized. A normalization procedure suitable for the present embodiments is the so called log-mean normalization, wherein the logarithm of the intensity of each picture-element is first normalized by the average of the logarithms of intensities and then exponentiated. Formally, this procedure can be written as:

$\begin{matrix} {\left. I_{combined}\rightarrow{\exp \left\lbrack {{\log \left( I_{combined} \right)}\frac{TL}{< {\log \left( I_{combined} \right)} >}} \right\rbrack} \right.,} & \left( {{EQ}.\mspace{14mu} 15} \right) \end{matrix}$

where <log(I_(combined))> is the average of the logarithms of intensities calculated over the combined image, and TL is a constant parameter which represents the log-mean of the normalized image. The method optionally and preferably proceeds to 16 at which the combined and/or normalized image is transmitted to a computer readable medium, from which it can be displayed or printed as desired.

In various exemplary embodiments of the invention the characteristic dynamic range of the combined and/or normalized image is lower than the characteristic dynamic range of the original image. For example, when the characteristic dynamic range of the original image spans over 4 or more orders of magnitudes, the characteristic dynamic range of the combined image is three or two orders of magnitudes. In various exemplary embodiments of the invention the characteristic dynamic range of the combined image is sufficient to allow all the intensities of the image to be displayed on a display device. For example, in some embodiments of the present invention the combined image comprises no more than 256 different intensities.

The method ends at 17.

FIG. 4 is a schematic illustration of a system 40 for processing an image, according to some embodiments of the present invention. System 40 comprises a data processor 42 having a computation module which comprises at least an image decomposer 44 configured for decomposing the image into a set of scaled images, a scaled image processing module 46 configured for processing each scaled image of set to provide a processed scaled image, and an image combiner 48 configured for combining at least some of the processed scaled images to provide a combined image. In various exemplary embodiments of the invention the computation module of data processor 42 is configured for executing at least some of the operations described above with respect to method 10-17.

FIG. 5 is a schematic illustration of an imaging system 50, according to some embodiments of the present invention. Imaging system 50 comprises an image capturing system 52 and system 40. Image capturing system 52 can be of any type, including, without limitation, a digital camera, a video camera, a CMOS digital camera, an infrared camera, an X-ray camera, a scanner, a microwave imaging, a computerized tomography scanner, a magnetic resonance imaging scanner, a mammography scanner, an ultrasonic scanner, an impedance imaging system, an endoscopic imaging device, a radio telescope, a digital telescope, a digital microscope and a system for translating an analog image to a digital image.

As used herein the term “about” refers to ±10%.

The word “exemplary” is used herein to mean “serving as an example, instance or illustration.” Any embodiment described as “exemplary” is not necessarily to be construed as preferred or advantageous over other embodiments and/or to exclude the incorporation of features from other embodiments.

The word “optionally” is used herein to mean “is provided in some embodiments and not provided in other embodiments.” Any particular embodiment of the invention may include a plurality of “optional” features unless such features conflict.

The terms “comprises”, “comprising”, “includes”, “including”, “having” and their conjugates mean “including but not limited to”.

The term “consisting of” means “including and limited to”.

The term “consisting essentially of” means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.

As used herein, the singular form “a”, “an” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a compound” or “at least one compound” may include a plurality of compounds, including mixtures thereof.

Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.

Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.

It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.

Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.

EXAMPLES

Reference is now made to the following examples, which together with the above descriptions illustrate some embodiments of the invention in a non limiting fashion.

Example 1 Thermal Images

The technique of the present embodiments was employed to process thermal images acquired using an infrared camera. FIGS. 6A, 7A, 8A, 9A, 10A and 11A show the original images, before processing. Each image was processed according to some embodiments of the present invention as described in the flowchart of FIG. 1. The image was decomposed into 3 scaled images each of which was processed using EQs. 1, 2, 4, 5 and 7-14. The modulation coefficient was set to 1.

For the contrast-based adaptation procedure, up to six possible resolutions were employed. The application of the specific resolution was done such that the coarse resolution was applied to the overall scales for the whole image. In other word, fine resolution in the SORF were not included in the coarse general scales. Table 1 specifies the radii of the center and surround regions which were used for each resolution k.

TABLE 1 k radius of the center region radius of the surround region 1 1 3 2 3 9 3 5 15 4 7 12 5 9 27 6 11 33

FIGS. 6B, 7B, 8B, 9B, 10B and 11B show the images after processing and combining. As shown, the technique of the present embodiments allows distinguishing between image features which were undistinguishable before the processing.

Example 2 HDR Images

Embodiments of the present invention were applied to High Dynamic Range images in an RGBE format. The original HDR images were obtained through the courtesy of Michael Werman, Erik Reinhard, Greg Ward, SpheronVR AG, Munsell Color Science Laboratory, and Paul Debevec's website.

Achromatic intensities were extracted from the polychromatic data of the images. This was performed by transforming each pixel in the RGBE image to CIE XYZ using the D65 sRGB transform matrix [IEC 61966-2-1:1999]. The achromatic intensity of each pixel was defined as the Y value of the pixel. Further transformation from the CIE XYZ space to CIE xyz was performed for each pixel, and the x,z, values were applied to the new achromatic intensities yielded according to various exemplary embodiments of the present invention.

Each image was processed according to some embodiments of the present invention as described in the flowchart of FIG. 1. The image was decomposed into 4 scaled images each of which was processed using EQs. 1, 2, 4, 5 and 7-14. The modulation coefficient was set to 1. The parameters used in the contrast-based adaptation procedure were as detailed in Example 1, above.

The dynamic ranges of original images are larger than the maximal displayable dynamic range of a conventional display device or a conventional printer and are therefore not shown. The combined and normalized processed scaled images are shown in FIGS. 12A-D.

The present example demonstrated the ability of embodiments of the present invention to perform automatic high dynamic range compression. The difference between the dynamic ranges of the original and processed image was up to about 10¹⁰ levels of intensity. The results demonstrate a significant compression while preserving, and even slightly enhancing the details in the images, both in the bright and dark zones. The technique has been applied for a large number of images. Although most of the experiments were performed using the same set of parameters, the dynamic compression was successful in all processed images. Yet, the technique of the present embodiments can be applied to different extents by assigning different values to the parameters.

Example 3 Optimization Considerations

A multi-resolution representation of an image I, referred to below as “the artist” image was produced according to some embodiments of the present invention using a Reduce operation featuring Gaussian weights, thereby forming a Gaussian pyramid. The process was applied as follows:

B _(n)=Reduce(B _(n−1)),  (EQ. 16)

where the finest resolution B₀ was set to be the original image:

B ₀ =I.  (EQ. 17)

The obtained Gaussian pyramid for a set of six scaled images I₀, . . . , I₅ is shown in FIGS. 13A-F.

A relative luminance q^(n) was then calculated according to the relation:

q ^(n) =I ^(n) /I ^(n+1),  (EQ. 18)

where I^(n) is the nth scaled image and I^(n+1) is an interpolated version of the n+1 scaled image obtained using the Expand operator:

I ^(n) =B _(n)

I ^(n+1)=Expand(B _(n+1))  (EQ. 19)

The calculation of relative luminance resulted in a luminance pyramid corresponding to the relative luminance levels q⁰, . . . , q⁴, shown in FIGS. 14A-E.

Once the luminance pyramid was obtained, a local contrast C^(k) was calculated for each luminance level q. The local contrast was calculated according to EQ. 6A above with ε=0.3. This resulted in a local contrast pyramid corresponding to the local contrast levels C⁰, . . . , C⁴, shown in FIGS. 15A-E.

Each of the local contrast levels C⁰, . . . , C⁴ was used for calculating a image-specific exponent γ according to EQ. 5A, thus providing a set of exponents γ⁰, . . . , γ⁴. The corresponding representation pyramid is shown in FIGS. 16A-E. Thereafter, EQ. 4 was employed for calculating an effective saturation function R^(n) for each luminance level q^(n), and each exponent γ^(n). This resulted in a saturation pyramid corresponding to the functions R⁰, . . . , R⁴, shown in FIGS. 17A-E.

The effective luminance levels were then combined according to EQ. 14B, to provide a combined image I_(combined). The original image I and the combined image I_(combined) are shown in FIGS. 18A and 18B respectively.

Multi resolutions algorithms tend to suffer from halo artifacts that appear around sharp edges. The present inventors contemplate both embodiments in which a bilateral filter is used and alternative embodiments in which a bilateral filter is not used. In the latter embodiments, considerations regarding the coarse contrast of the image are preferably made. In various exemplary embodiments of the invention the processing technique applies one or more smoothing filter, such as, but not limited to, Gaussian filter and performs the enhancement adaptively, as further detailed hereinabove. For areas in the image where the contrast is high, the contrast is not enhanced and may even be decreases.

It was found by the present inventors that for images that are relatively dark, the use of global gain may improve the quality of the final image. Thus, in some embodiments of the present invention, a global gain operation is applied in addition to the local contrast enhancement. This can be achieved in any technique known in the art. In some embodiments, all of the image pixels are raised to the same constant power, which is optionally and preferably less than 1:

I→I ^(p)  (EQ. 20)

In various exemplary embodiments of the invention the gain exponent p is selected using an optimization procedure. For example, a set {p_(j)} of candidate values of the gain exponent can be selected and a score can be assigned to each candidate gain exponent p_(j) of the set. The gain exponent is then selected from the candidates responsively to the assigned scores. Typically, the gain exponent is the candidate having the highest score. Formally, denoting the score assigned to the jth candidate gain exponent by S(I^(p) ^(j) ), the gain exponent can be calculated using the operation:

$\begin{matrix} {p = {\underset{p_{j}}{\arg \; \max}\left( {S\left( I^{p_{j}} \right)} \right)}} & \left( {{EQ}.\mspace{14mu} 21} \right) \end{matrix}$

A score suitable for the present embodiments is characteristic contrast of at least one of, more preferably at least some of, most preferably all, the scaled images in the set. Such characteristic contrast is referred to herein as a “Contrast Measure”.

In some embodiments, the Contrast Measure is a local-global contrast. A representative example of a technique for calculating a local-global contrast suitable for the present embodiments is found in Rizzi et al., “A Modified Algorithm for Perceived Contrast Measure in Digital Images,” CGIV 2008 and MCS′08 Final Program and Proceedings, the contents of which are hereby incorporated by reference. For example, the Contrast Measure can be calculated by summing the averages of the pyramid levels.

$\begin{matrix} {{{Contrast}\mspace{14mu} {{Measure}(I)}} = {\sum\limits_{n}\; {{Average}\left( {I_{n} - I_{n - 1}} \right)}}} & \left( {{EQ}.\mspace{14mu} 22} \right) \end{matrix}$

The summation in EQ. 22 is optionally and preferably performed over all the elements in the set. EQ. 22 ensures that an image with high contrast yields higher values for the contrast measure, wherein an image with low contrast yields lower values for the contrast measure.

When the score S is enacted by the function Contrast Measure, EQ. 22 is applied for each candidate p_(j), and the gain exponent p is preferably selected using the operation:

$\begin{matrix} {p = {\underset{p_{j}}{\arg \; \max}{\left( {{Contrast}\mspace{14mu} {{Measure}\left( I^{p_{j}} \right)}} \right).}}} & \left( {{EQ}.\mspace{14mu} 23} \right) \end{matrix}$

The size of the set (namely, the number of scaled images in the set which is the number of different resolutions employed) can be selected by the size of the image or by the amount of information in the image.

When the size of the set is selected based on the size of the image, it is optionally and preferably selected such that the coarsest resolution is of a predetermined size, for example a 64×64 resolution or a 32×32 resolution or a 16×16 resolution. Selecting the size of the set base on the size of the image is particularly useful when a relatively small portion of the input image is occupied by background.

When the size of the set is selected based on the amount of information in the image, the size of the set is preferably determined during the buildup of the set. In these embodiments, the amount of information that is being added by each resolution is determined. Once the amount of added information is below a predetermined threshold, the set buildup is terminated. The amount of added information can be calculated, for example, by counting the number of identifiable features (e.g., edges, distinguishable regions) in each scaled image. Selecting the size of the set base on the size of the image is particularly useful when a relatively large portion of the input image is occupied by background, and is generally preferred from the standpoint of computer resources.

Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.

All publications, patents and patent applications mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting. 

What is claimed is:
 1. A method of processing an image, comprising: obtaining an image decomposed into a set of scaled images, each being characterized by a different image-scale; for each of at least some scaled images of said set, calculating a local contrast within said scaled image; processing each scaled image of said set using a function of said local contrast, thereby providing a processed scaled image; combining at least some of the processed scaled images to provide a combined image; and outputting said combined image to a non-transitory computer readable medium.
 2. The method of claim 1, wherein said obtaining comprising receiving the image and decomposing the image into said set of scaled images.
 3. The method of claim 2, wherein said decomposing comprises selecting a size of said set based on a size of the image.
 4. The method of claim 2, wherein said decomposing comprises determining an amount of information in each scaled image being formed, and ceasing said decomposing when said amount of information is below a predetermined threshold.
 5. The method according to claim 1, wherein a characteristic dynamic range of said combined image is lower than a characteristic dynamic range of the original image.
 6. The method according to claim 1, further comprising calculating a relative luminance between said scaled image and another scaled image of said set, using intensities in said scaled image and intensities in said another scaled image, wherein said function of said local contrast is also a function of said relative luminance.
 7. The method according to claim 6, wherein said processing comprises modulating each relative luminance to provide a plurality of modulated relative luminance levels, wherein said combining comprises combining said modulated relative luminance levels.
 8. The method according to claim 6, wherein said set is an ordered set and wherein said relative luminance is expressed as function of a ratio between said intensities in said scaled image and said intensities in said other scaled image.
 9. The method according to claim 1, wherein said function of said local contrast comprises an exponent, which is a decreasing function of said local contrast.
 10. The method according to claim 9, wherein said decreasing function of said local contrast comprises a linear decreasing function of said local contrast.
 11. The method according to claim 1, wherein said local contrast is calculated using an adaptation procedure employed for each picture-element of said scaled image.
 12. The method according to claim 11, wherein said adaptation procedure calculates said local contrast based on a difference between a second order opponent receptive field function calculated for said picture-element and a second order opponent receptive field function calculated for nearby picture-elements.
 13. The method according to claim 1, wherein said function of said local contrast comprises a modulation function having higher values when said local contrast is low, and lower values when said local contrast is high.
 14. The method according to claim 1, wherein the image is of at least one type selected from the group consisting of a visible light image, an HDR image, a stills image, a video image, an X-ray image, an infrared image, a thermal image, a ultraviolet image, a computerized tomography (CT) image, a mammography image, a Roentgen image, a positron emission tomography (PET) image, a magnetic resonance image, an ultrasound images, an impedance image, an elastography image, and a single photon emission computed tomography (SPECT) image.
 15. A method of capturing and displaying an image, comprising capturing an image of a scene and processing the image using the method according to claim
 1. 16. The method of claim 15, wherein said capturing the image comprises recording radiation selected from the group consisting of visible light, infrared light, ultraviolet light, X-ray radiation, radiofrequency radiation, microwave radiation and ultrasound radiation.
 17. A non-transitory computer software product, comprising a non-transitory computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to obtain an image decomposed into a set of scaled images, each being characterized by a different image-scale; to calculate, for each of at least some scaled images of said set, a local contrast within said scaled image, to process each scaled image of said set using a function of said local contrast, thereby providing a processed scaled image, and to combine at least some of the processed scaled images to provide a combined image.
 18. An image processing system, comprising: an electronic input receiving an image decomposed into a set of scaled images, each being characterized by a different image-scale; a scaled image processing module configured for calculating, for each of at least some scaled images of said set, a local contrast within said scaled image, and for processing each scaled image of said set using a function of said local contrast, thereby to provide a processed scaled image; and an image combiner configured for combining at least some of the processed scaled images to provide a combined image.
 19. An imaging system, comprising an image capturing system and the system according to claim
 18. 20. The imaging system of claim 19, wherein said capturing system is selected from the group consisting of a digital camera, a video camera, a CMOS digital camera, an infrared camera, a thermography device, an X-ray camera, a scanner, a microwave imaging, a computerized tomography scanner, a single photon emission computed tomography device, a positron emission tomography device, a magnetic resonance imaging scanner, a mammography scanner, an ultrasonic scanner, an impedance imaging system, an endoscopic imaging device, an elastography device, a radio telescope, a digital telescope, a digital microscope and a system for translating an analog image to a digital image. 